Model Quantization, ONNX Runtime, Embedded Inference, TinyML

ML Systems Textbook by Havard
mlsysbook.ai·5h·
Discuss: Hacker News
🚀MLOps
Flag this post
OpenAI experiment finds that sparse models could give AI builders the tools to debug neural networks
venturebeat.com·2d
💬Prompt Engineering
Flag this post
Attention really is all you need — The Encoder
pub.towardsai.net·1h
🤖Transformers
Flag this post
Adaptive AI: Making Edge Inference Smart and Fast by Arvind Sundararajan
dev.to·5d·
Discuss: DEV
🔥PyTorch
Flag this post
Generative AI and the P=NP problem
lesswrong.com·14h
🧮SMT Solvers
Flag this post
GNN From Scratch
cultured-avenue-f13.notion.site·7h·
Discuss: r/programming
🕸️GraphBLAS
Flag this post
Selective (smart) MoE experts offloading to CPU?
arxiv.org·4d·
Discuss: r/LocalLLaMA
🧩mimalloc
Flag this post
Shattering the Illusion: Maker Achieves Million-Step, Zero-Error LLM Reasoning
cognizant.com·8h·
Discuss: Hacker News
🎭Program Synthesis
Flag this post
Reauthoring and Converting models for edge inference: MambaV2 on LiteRT
sachinjoglekar.substack.com·17h·
Discuss: Substack
🔥PyTorch
Flag this post
Understanding neural networks through sparse circuits
openai.com·2d·
Discuss: Hacker News
📝Parser Combinators
Flag this post
DenoGrad: Deep Gradient Denoising Framework for Enhancing the Performance of Interpretable AI Models
arxiv.org·2d
🔬Deep Learning
Flag this post
EyesOff: I Built a Screen Contact Detection Model
ym2132.github.io·22h·
Discuss: Hacker News
👁️Computer Vision
Flag this post
Building a simple RAG system in PHP with the Neuron AI framework in one evening
neuron-ai.dev·10h·
Discuss: DEV
💬Prompt Engineering
Flag this post
Goodbye *ibe Coding
ikouchiha47.github.io·6h·
Discuss: Hacker News
💬Prompt Engineering
Flag this post
A Deep Dive into Self-Attention and Multi-Head Attention in Transformers
medium.com·18h·
Discuss: r/LocalLLaMA
🤖Transformers
Flag this post
From ETL to AI(e)tl: Rethinking Data Pipelines for the AI Era
evanvolgas.substack.com·1d·
Discuss: Substack
🦙Ollama
Flag this post
I Measured Neural Network Training Every 5 Steps for 10,000 Iterations
towardsdatascience.com·16h
🎯Reinforcement Learning
Flag this post
Teaching AI to see the world more like we do
deepmind.google·3h·
Discuss: Hacker News
👁️Computer Vision
Flag this post
NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
paperium.net·2d·
Discuss: DEV
QuestDB
Flag this post
Scheduling in LLM Inference
fergusfinn.com·1d·
Discuss: Hacker News
🐍Python
Flag this post